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Objectives 

We will demonstrate an innovatory system for the sequencing of nucleic acids on a single 
semiconductor chip* This technology combines four key technologies already devised, 
developed and led by members of our collaboration. We fit squarely within biotech area 2.' 
IjDiaGene demonstrates the synergistic power of our combining our individual technologies: 

* intrinsic absorption imaging and sequencing of nucleic adds- an unconventional unlabeled 
approach -pioneered and demonstrated in the Imperial College (IQ physics dept.; 

• the methods of analysis on a chip as pioneered and demonstrated at IC Centre for 
Analytical Sciences under in [iTAS (Total Analysis Systems) programme; 

• the development of UV-sensitised silicon pixel detectors, first applied in biomedical 
imaging and taken to its highest level of development at University of Cagliari (Cag) and 

* the use of microridge sensor topologies (devised at IC) in Chemical Vapour Deposition 
Diamond, among whose leading proponents is our subcoVitractor the University of Lodz, 

*» Our team includes the foremost company involved in the manipulation of narto liter 

quantities of biological materials,, Evotec GmbH., who also provide a user-driven edge. 
We will demonstrate a novel sequencing system based on the intrinsic absorption in nucleic 
acids of ultraviolet (UV) light The demonstration step will comprise our producing nucleic 
add sequences in four research laboratories, all well known as specialists for different 
sequences. Comparison at the single base-pair (Bp) level will be undertaken in which the same 
chip is operated both in the conventional mode using chcrnilumiriescent labels and using our 
absorption technique- Naturally, the same sequence sample will also be checked on a 
conventional commeroal system to provide extra dhuecktog redujadandes. The fiDi^G^c 
system is based on the MIDIA-approach to molecular imaging for nucleic add and protein 
sequencing and mapping, a technique developed at IC over the last three years and in the 
process of being comrnerdalisecL Its advantages are: 

• the complete removal of mutagenic, toxic and carcinogenic chemi-, biciiiminescent and 
radiolabeis with associated benefits in running costs, health and safety, operational and 
framing and disposal imperatives; 

an order of magnitude speeding of a typical sequence: a 6 hour (say) 500 Bp sequence 
electrophoresis can be achieved in 35 minutes in MEDIA and perhaps 10 minutes in 
lilHaGeruz. This saving in time comes because the image of bands identified by 
conventional labels is smeared by the isotropic emission of those labels (whether a 
radioactive label, or a laser-induced fluorescent detection (CE-LEF) system) while in the 
absorption technique we gain a factor of ten since the image of the nucleic acid band is the 
same size as the band (smeared slightly by diffraction). This can be made very small with 
jiXAS technology. This is a new paradigm in seqttcndng. 

* improved sensitivities and signal-to-noise in sub-microliter sequencing and imaging. 

Our specific goal is to demonstrate these existing macroscopic technologies - developed in the 
last two years and in the process of being commercialised - at the chip level. fiDiaGenc can in 
prindple also accept conventionally labelled nucleic acids to allow ht situ cross checking. Our 
target sequences include: Arabidapsis, Lactococus Lactis, HIV, and an analysis of the spectrum of 
mutations in Cystic Fibrosis, X-Linked Adrenal Hypoplasia, and Crigler-Lajjar Syndrome- 

The objective of this project is to demonstrate economic, advanced and automated high 
throughput megabase DNA sequencing technologies, based on no^el nucleic add detection 
systems, while minimising, perhaps to zero, any environmental or health and safety risks. 
Results are intrinsically digital, and given the exceptional sensitivities, rapid tum-around 
possible and high throughputs attainable, the fjDiaGene technologies are ideally suited to the 
Human Genome research networks. Since the fiDiaGene system allows pre-programmed 
sequence lecognition masks to accompany any installation, in prindple a "yes-no" answer 
could be obtained by relatively untrained end-users. A sequence triage system results, which 
coupled to the extremely low cost of this technology, can result in the wide dissemination of 
fiCHaGcne systems, to surgeries, schools and individual researchers' desk-tops. 



2 Work Content 
2*0 Introduction 

The current technologies use photographic techniques to image radioactive or lurrtinescent^^^ 
attached to the nucleic acids - a time-cxrnsuming procedure, or phosphor imagers to record the 
sequences - an expensive alternative. Both processes use hazardous chemicals to tag the nucleic 
adds and the safe use and disposal of these is a major problem, and both require skilful scientific 
and technical input. Both are relatively slow. In any emission technique (eg. CE-UT) the tim£-to- 
sequence (TTS) depends on the separation gradient (the electric field)and discriminator power (e.g. the 
capillary or electrophoresis gel) convolved with the size of the objecis to be separated. The key ingredient 
in {iDiaGene is the benefit miniaturisation brings. Reducing imaging dimensions by a factor 10 
decreases time-to-sequence (TTS) by 100. By using intrinsic absorption, we reduce greatly the 
strong dependence on sample volume present in conventional emission techniques. 

liDtaGene will transform the results of four distinct research programmes into a working 
technology, suitable for commercialisation. We wish to emphasise that, while uDiaGcne is 
defixuiely pre-coropetitrve, our technology is such that the commerdalisaticn process can 
relatively fast. fiDiaGeiie has substituted the technology of hardware, moving parts, chemistry ^j\d 
biological processes to one involving rapidly reproducible chip technology and software. jxDiaGene 
will excel in an area which is a known to be a huge market, but improve upon existing technologies 
in every respect The only area in which fjDiaGene is at a nominal disadvantage compared with 
conventional technologies - handling nanoliters of biomaterials - is one in which two partners (IC 
and Evotec) have specialised and which they have solved. Our lean management structure and 
focused pa rtn er s hip has redundancy only where unforeseen circumstance could slow the rapid 
prototype development. The target the fiDiaGene sequencer chip - is well-defined. 

2.1 The MIDIA programme. 

The MIDIA system has been shown to work using electrophoresis geis: polyacrylimide for 
sequencing and agarose for restriction enzyme maps. The technique is simple: DNA bands are 
separated in polyacrylimide get but are imaged not through radioactive emission or photon 
emission following UV laser activation (CE-LXF), but through the observation of a given band 
through intrinsic absorption: we make multiple images of the band shadows and build up a 
picture of their passage, sorting them into velocity (rather than spatial bands). 

Figure 1: The side view of MIDIA technology. 




spatial domain velocity domain 

The time-development of the separating bands of PCR marker can be seen in figure 2. Here bands 
have developed over time and as they progress along the polyacrylimide geL By grouping signals 
in velocity space, we obtain the sequence. The figure shows a PCR ladder with bands 
corresponding to 10,20,50,100,200,500 and 700 base pairs with approximately 2ng/mL in each 
band. The plot shows how the MEDIA system sorts DNA clearly into a sequence. Nota: Since 
taking this image, we have improved the signal to noise by about 100, of which a factor 50 is 
necessary to go to Ibp resolntion where the bands are broadened, out at 700bp. 
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2.2 The fiDiaGene programme 

The objectives of fsDiaGene are to demonstrate: 
this technology on a diamond/silicon chip 
approximately 20 x 20 mm. A schematic is 
shown in figure 4 with the side view 
resembling figure 1. To prove this 
technology we will sequence a range of 
important biomoleculea at our 
biotechnology laboratories in Cambridge, 
Imperial College Hamburg and Cagliari- 
This pan-Europe approach will allow a 
rapid and robust benchmarking and aid 
greatly in our ultimate technology 
dissemination. 
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Figure 5 shows a scanning electron 
microscope image of a 20 [mv ridged CVD 
diamond before laser cleaning Two 
channels (the 5th and 6th from the rirfit) 
have coalesced due to an error in the laser 
ablation process. This mistake was the only 
one found in this wafer's etching process 
The image covers some 400 \uxy x 370jun. 
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Figure 6: The channels are extremely 
cleanly etched: these are 20a wide 




233 Silicon^nxet detector These have been 
developed to their highest level by Cagliari, 
who have used them in a range of 
biotechnological and medical applications, 
A silicon pixel device is better than a CCD 
here since it can be made much more 
cheaply in large quantities and does not 
suffer from the saturation a CCD could 
experience in this application. The pixels 
are made to match in one direction the size 
of the troughs - nominally 200 Jim - and in 
theother dimension the size of the bands in 



Figure 8: A X-ray image of a mouse in 
the Cagliari pixel detector with ptDiaGenc 



the direction of their travel, which we take 
to be 150 mxu The pixels work equally w*U 
(with quantum efficiencies of over 90% and 
higjtly linear in response) in the UV as the 
X-ray region. The demonstration will also 
be made with UV-sensitised silicon 
photodiode arrays. However, this is for 
cross checks only: the pixel array is 
essential to get the channel density required 
on the wafer. The maximum density of 
sequencing lanes possible in the latter 
technology is 4 per mm; we intend to reach 
at least 200 channels in 20mm wide wafer. 

Figure 7 Pixel layout and interconnects 
in a Cagliari pixel detector xvith fjJOiaCene 
topology. 




topologies and readout This image has 
170 x 170pm resolution and 16 bit 
intensity. 

2.3.4 pTotal Analysis System development: 
The lead of Imperial College in the 
academic arena and Evotec Gmbfi's in the 
industrial arena in this area has over the 
last 5 years increased and been reinforcetL 
The Manz group at IC will work closely 
with the Evotec GmbH team assigned to 
liDiaGene on all aspects of liquid handling 
and ancillary (non-optical, non-readout) 
instrumentation. The integration of the 
detection system and a multipurpose fluid 
interface to the macroworld onto the chip, 
has led to the integrated microsystem, with 
fluidic, optic and electronic components. 
We identify individual tasks as being; 
liquid handing and automation^ mechanical 
rnicrcmanrpulators r microdispenser syringe 
development, liquid electrostatic gating 
system and wafer re-use technology. The 
rnicromarupulation technological problems 
are demanding but well within the reach of 



es 



our collaboration. As is shown in figure 4, the mkrochannels which act as capillaries con 
just 150 fxm deep, and 120^m wide and yet, as is shown in figure 3, achieve a S/N of 
(2ng/p.L DNA). The channels will be some 18mm long, in a 20 x 20 mm wafer. The H _ 
handling is made easier by the ridges opening apertures: They are enlarged at their entrances 
make a fan-in- This is possible since between the microridgc "troughs" we have an extra 100 
\im ridge top "deadspace"* The demonstration system must therefore accept the nucleic acid 
mixture through a microsyringe system capable of delivering (say) 120 x 150 x 150 pm liquid = 
1.5 x 10"* 1 liquid into groups of five receptors 250jim across* Initially, we need only run only 
one group of 5 lanes (one for each termination and one calibration). The 18mm wide wafer can 
take some 3600 troughs in principles we are not likely to want to attempt that in the 
Demonstration phase of this project, but any fluid handling system will be designed with the 
potential in mind: 

Figure 9: /iTAS sequencing 



QLS-i 



i 




We will require : 

multiple liquid input with 10% accuracies 
in quantity and 10 jim precisions in 3 
dimensions. Evotec GmbH are experts in 
micro-separation methods/ fluid handling 
etc. and have worked with IC for many 
years. The power of the fiTAS approach is 
shown in the figure 9. This shows an 
elecrtropherogram of a fluoresoein4abeled 
phosphorothioate oligonucleotide mixture 
PS pd(T)10-25 mixture recorded at 1130 
V/cm in a 10% T non-cross-linked 
polyacrylimide matrix, with chain length* 
tentatively assigned, but dearly capable of 
lbp separation- In Emission mode (as 
shown above), we sequence within a few 
minutes. It is possible we can reduce this in 
absorption mode. 



2.3.5 Readout and data acquisition: we have designed and used a range of pixel 
readout systems. 

2.3.6 Analysis and Interpretation: The pattern recognition will be adapted from the 
existing programs which have worked for macroscopic MEDIA. Note: {iDiaGcne 
output is intrinsically digital, and therefore can be collected and analysed remotely If 
necessary. However, we wish also to develop software mask?, which can be 
supplied along with the hardware systems, which will allow inexpert use of the 
system and gain a first-pass assessment of a given sequence. Clearly, this approach 
will require extensive testing, and we have assigned a subcontractor, IDFA 
Cambridge, to coordinate this side of the exploitation. 

2.3.7 Demonstration and Benchmarking: Our collaboration includes sequencing groups 
from Imperial College, Cagliari and Cambridge. IDFA Cambridge is a spin-off from 



the MIDIA approach. They will decide upon and coordinate a concerted 
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